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We investigate the impact of theoretical uncertainties on the accuracy of measurements involving 
hadronic jets. The analysis is performed using events with a Z boson and a single jet observed 
in pp collisions at yfs = 1.96 TeV in 4.6 fb _1 of data from the Collider Detector at Fermilab 
(CDF). The transverse momenta (pt) of the jet and the boson should balance each other due 
to momentum conservation in the plane transverse to the direction of the p and p beams. We 
evaluate the dependence of the measured px-balance on theoretical uncertainties associated with 
initial and final state radiation, choice of renormalization and factorization scales, parton distribution 
functions, jet-parton matching, calculations of matrix elements, and parton showering. We find 
that the uncertainty caused by parton showering at large angles is the largest amongst the listed 
uncertainties. The proposed method can be re-applied at the LHC experiments to investigate and 
evaluate the uncertainties on the predicted jet energies. The distributions produced at the CDF 
environment are intended for comparison to those from modern event generators and new tunes of 
parton showering. 
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I. INTRODUCTION AND PHYSICS MOTIVATION 



The discovery potential of LHC experiments will strongly depend on the accuracy of standard model (SM) predic- 
tions for processes containing hadronic jets [l| , as the first step in establishing "new physics" has to be identifying an 
incontrovertible deviation from SM phenomena. The uncertainties on predictions for SM processes directly impact 
measurements of jet spectra, searches for new heavy particles using jet energies for kinematic reconstruction, and the 
calculation of missing transverse momentum, to name several prevalent analysis strategies as examples. The discovery 
potential for supersymmetry and other models of physics beyond the SM thus relies on having calibrated methods for 
measuring jet energies. 

It has been common practice to normalize the clustered jet energy, measured with calorimeters, to the energy of 
the particle jet or the parent parton [HQ- The correction factor is often called the jet energy scale (JES). At CDF 
jets are observed in non-compensating sampling calorimeters, which have a non-linear response to single particles. 
The simulated calorimeter response for single hadrons is tuned to match that in data The measured jet energy is 
corrected for instrumental effects such as the non-linear response of the calorimeters as well as for parton radiation 
and hadronization effects. The correction for radiation and hadronization effects is independent of the experimental 
setup. In addition, the jet energies are corrected for multiple pp interactions in the same bunch crossing. 

The systematic uncertainties on the JES and the related measurements arise from the accuracy of the detector 
simulation and limitations of the methods used by SM event generators. The event generators, such as PYTHIA and 
ALPGEN [3, [f|, use a simplified modeling of complex SM processes that can be altered by tuning internal parameters. 
The model-dependent aspects we investigate are the following: 

• parton distribution functions (PDFs) of the colliding p and p 

• leading order (LO) matrix elements of tree-level processes such as qq — ¥ Zg and qg — > Zq 

• the parton-jet matching scheme @ 

• final state radiation (FSR) 

• initial state radiation (ISR) 

• the renormalization and factorization scales 

• residual effects due to multiple pp interactions 

• the ability of the leading-log parton showering model to describe radiation at large angles. 
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To perform the analysis we select events with a Z boson and a jet observed in 4.6 fb^ 1 of data from CDF. A Z 
boson is clearly identified as a pair of opposite-sign electrons or muons with an invariant mass close to the Z-boson 
mass. The transverse momentum of the boson is measured with high precision so that the Z+jet sample is ideal for 
the analysis. We use the px-balance in the event, with the mean- value of the ratio pT(jet)/pT(Z) as the observable 
of interest, to test the simulated SM predictions. 

The determination of the jet energy scale used in previously-published CDF analyses was performed with about 300 
pb- 1 of data @j. The overall uncertainty on the JES was compared to the difference in pT-balance between data and 
MC predictions for a photon-jet and a Z-jet samples; the difference in px-balance was calculated in a similar fashion 
to the method used in the current analysis. Having significantly more data (4.6 fb _1 ) we investigate the systematic 
uncertainties affecting measurements of jet energies independently from the previously evaluated uncertainties on the 
JES. Using the additional statistical power, we can disentangle the different effects contributing to the uncertainties 
by correlating the limitations in theoretical predictions with their effect on the pT-balance. 

The outline of the paper is as follows. The CDF II detector is described in Section [TTJ Details for MC event 
generators are presented in Section IIIII Event selection follows in Sections IIVI and [V] We show px-balance in data 
and PYTHIA as a function of pt(Z) in Section fVTl In the same section we further validate LO predictions from 
PYTHIA and ALPGEN by checking that the relative contributions from qg — s> Zq and qq — > Zg diagrams are accurately 
modeled. Computation-related uncertainties such those due to the choice of factorization and renormalization scales 
are evaluated in Sections IVII1 IVIIIi IIX1 IXl and IXII1 We use data to evaluate uncertainties due to mis-modeling of 
parton radiation at large angles and multiple pp interactions in Sections IXII and IXIII1 respectively. The uncertainty 
due to detector simulations is calculated in Section IXIV1 In Section IXVI we summarize the observed uncertainties 
and compare those to the difference in px-balance between data and the MC predictions. We present conclusions in 
Section [XVD 

II. THE CDF II DETECTOR 

The CDF II detector is a cylindrically symmetric spectrometer designed to study pp collisions at the Fermilab 
Tevatron. The detector has been extensively described in the literature Q. Here we briefly describe the detector 
subsystems relevant for the analysis. 

Tracking systems are used to measure the momenta of charged particles, and to trigger on and identify leptons with 
large transverse momentum, pt @- A multi-layer system of silicon strip detectors (SVX) (§], which identifies tracks 
in both the r — <f> and r — z views [Toj . and the central outer tracker (COT) [Hj | are contained in a superconducting 
solenoid that generates a magnetic field of 1.4 T. The COT is a 3.1 m long open-cell drift chamber that makes up 
to 96 measurements along the track of each charged particle in the region |?7| < 1. Sense wires are arranged in 8 
alternating axial and stereo (±2°) super- layers with 12 wires each. For high momentum tracks, the COT pt resolution 
is ct Pt /p t ~ 0.0017 ( GeV/c)" 1 [ll]. 

Segmented calorimeters with towers arranged in a projective geometry, each tower consisting of an electromagnetic 
and a hadronic compartment [H,[ll|, cover the central region, \ij\ < 1 (CEM/CHA), and the forward region [l5j . 
1 < \r)\ < 3.6 (PEM/PHA). In both the central and forward regions, systems with finer spatial resolution are used to 
make profile measurements of electromagnetic showers at shower maximum [l6| for electron identification (the CES 
and PES systems, respectively). Electrons are reconstructed in the CEM with an Et Q resolution of <j(Et)/Et — 
13.5%/V-ET/GeV© 2% QJ] and in the PEM with an E T resolution of <t{E t )/E t ~ 16.0%/^T/GeV © 1% [Hi- 
Jets are identified using a cone clustering algorithm in r\ — (j> space, with cone radius R = y A?y 2 + A0 2 , as a group of 
electromagnetic and hadronic calorimeter towers. The CDF hadronic calorimeters have a steel-scintillator sampling 
design and the electromagnetic calorimeters are built from lead and scintillator. The sampling calorimeters have a 
non- linear response to stable hadrons Q, which carry most of the jet momentum. 

Muons are identified using the central CMU, CMP, and CMX [18|, [l9[ muon systems, which cover the kinematic 
region 77 1 < 1. The CMU system uses four layers of planar drift chambers to detect muons with px > 1.4 GeV/c in 
the central region of \r/\ < 0.6. The CMP system consists of an additional four layers of planar drift chambers located 
behind 0.6 m of steel outside the magnetic return yoke, and detects muons with px > 2.0 GeV/ c. The CMX detects 
muons in the region 0.6 < \rj\ < 1.0 with four to eight layers of drift chambers, depending on the polar angle. 

The beam luminosity is measured using two sets of gas Cherenkov counters, located in the region 3.7 < \r)\ < 4.7. 
The total uncertainty on the luminosity is estimated to be 5.9%, where 4.4% comes from the acceptance and operation 
of the luminosity monitor and 4.0% from the calculation of the inelastic pp cross-section pol ]. 

A 3-level trigger system Q selects events for further analysis offline. The first two levels of triggers consist of 
dedicated fast digital electronics analyzing a subset of the full detector data. The third level, applied to the full data 
from the detector for those events passing the first two levels, consists of a farm of computers that reconstruct the 
data and apply selection criteria for (typically) several hundred distinct trigger paths. 
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III. STANDARD MODEL PREDICTIONS FOR EVENTS WITH A Z BOSON AND JETS 

The standard model expectations for inclusive production of Z bosons are calculated from Monte Carlo simulations 
using pythia and ALPGEN. Events from the two MC generators are processed through the full detector simulation 
to be reconstructed and analyzed like data. 

The datasets for the Z + light jets signatures are produced using v6.216 of pythia in which the px spectrum of 
the Z bosons, pf, has been tuned to CDF Run I data for < pf < 20 GeV/c [2l|, an d which incorporates a tuned 
underlying-event, Tune AW 22]. The event generator was set to inclusive production of Z-bosons with a M(7* /Z) > 
30 GeV/ c cut. Historically, a di-jet sample simulated with pythia was used to determine the JES at CDF; in this 
study we take the Z + jets events from pythia as our default benchmark sample. 

Additional Z + jets samples are produced with v2.10-prime of ALPGEN that has built-in matching of the number 
of jets from parton showering and matrix-element production Q. The exclusive Z + N partons (N=0,..,4) samples 
were combined into one inclusive sample using the corresponding cross-sections provided by ALPGEN. Showering and 
hadronization of jets is done with pythia v6.326, Tune AW (22[. The jet-parton matching is performed at a px of 15 
GeV/c (referred to as the matching scale) using the jetclu clustering algorithm with a radius of R=0.4. 

Production of Z+jet events is performed differently by stand-alone pythia and ALPGEN+pythia. The ALP- 
GEN+PYTHIA calculation begins with the exact matrix elements from ALPGEN for Z+N partons (N=0,..,4), which are 
then interfaced with PYTHIA parton showering. The interface contains a veto algorithm that removes double-counting 
between matrix element and parton shower partons ,6;j . The stand-alone PYTHIA calculation begins with the simplest 
matrix element (Z+0 partons) and adds additional partons from the shower with no need for a veto. However, the 
first parton emission is corrected to reproduce the Z+l parton matrix clement. Thus, any substantial differences 
between the predictions of the two calculations (if they exist) should arise for the second jet. Both event generators 
use the same PDF set, CTEQ5L, and Lund string hadronization model as implemented in Pythia. 

The parton showering (PS) inside the jet cone (see Section IV A[) has been extensively studied [23U2l| and is in good 
agreement with predictions. In addition, the momentum spectra of charged-particle tracks in jets are found to be in 
good agreement with SM predictions Q. 

The large-angle (outside of cone R=0.4) parton radiation is not described well by stand-alone parton showering 
model in pythia; the rate of softer jets collinear to a jet is not described (e.g. see Fig. 3 in [26J, the distribution 
of Ai?(j2, j3) observed in multi-jet events). Radiation of the 3rd jet in multi-jet events is qualitatively equivalent 
to radiation of the 2nd jet in Z+jets events. The same issue affects the invariant mass calculated for a pair of jets, 
energy of a jet, and missing transverse momentum. The problem related to the parton radiation at large angles can 
be addressed by using an exact matrix element (ME) for multi-jet events as is done in ALPGEN+pythia simulation. 
The ME correction is introduced for radiated jets with px above the matching scale (15 GeV/ c); softer radiation is 
produced via the same leading-log parton shower mechanism. 



IV. DESCRIPTION OF DATA SAMPLES 

The analysis uses events that contain either an electron with Ex > 18 GeV or a muon with px > 18 GeV/c selected 
within the central region of the detector, \r/\ < 1, by the trigger system. The electron dataset contains 229M events; 
the muon dataset contains about 65M events. The integrated luminosity of each dataset is 4.6 fb _1 . 



V. EVENT SELECTION 

Both the observed and the simulated events (see Section IIII[) use the same selection criteria to identify electrons, 
muons, Z bosons, and jets. Details of the selection criteria for electrons and muons are provided in„ 



A. Jet identification 

Jets are reconstructed using jetclu, the standard CDF cone-based clustering algorithm, with cone radii of R = 
0.4, 0.7, and 1.0 [27|. The clustering is performed using calorimeter towers with raw (uncorrected) energy above 1 
GeV to form a cluster of at least 3 GeV. To resolve ambiguities with overlapping cones, cones sharing an energy 
fraction of more than 0.75 are merged into a single jet; otherwise the shared towers are assigned to the closest jet. 

The jet energies are corrected for the non-uniformity in rj of the calorimeter response and for multiple pp interactions. 
In this analysis, the leading jet energy is always corrected to the parton level; the jet energy scale is adjusted to relate 
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the measured energy of a simulated jet and the energy of the corresponding parton in di-jet events Q from PYTHIA. 
The jet clustering algorithm is run over calorimeter towers for reconstructed jets, and over stable particles for hadron- 
level jets. The correction factor from the hadron to the parton level is a function of only jet px, and is the same for 
data and predictions. The JES corrections at CDF do not take into account if a jet is initiated by a tree-level quark 
or a gluon. 

Calorimeter clusters that coincide with an identified electron, or photon are removed from the jet collection to avoid 
ambiguities. High-px photons are not rare in hard-scattering events. Identifying photons as jets and then correcting 
them as jets can lead to mis-measured px-balance. To avoid photon misidentifications the event selection requires 
the leading jet to have EM-fraction less than 0.95. The EM-fraction is the fraction of energy of a jet deposited in the 
electromagnetic compartment of the calorimeter in comparison to the total energy of the jet. 

B. Reconstruction of Z + jet events 

Pairs of oppositely-charged electrons and muons are identified as Z-boson candidates if the reconstructed invariant 
mass falls in the mass window from 80 GeV/c 2 to 100 GeV/c 2 . The selection of Z — > ££ events requires two tight 
leptons or a tight and a loose lepton (see q). The two leptons are required to be assigned to the same primary 
vertex, which is required to have a z-coordinate within 60 cm from the center of the CDF detector. Also we remove 
dimuon events where the leading jet overlaps (A R < 0.4) with one of the muons forming a Z boson. Figure [1] shows 
the distributions in invariant mass for electron and muon pairs; the data are in a remarkable agreement with SM 
predictions from PYTHIA and ALPGEN. 




M [GeV/c 2 ] M [GeV/c 2 ] 

(a) (b) 

FIG. 1: The observed (points) and expected (histogram) distributions in the invariant mass of e + e~ (a) and 
(b) lepton pairs. The slight mismatch in data and Monte Carlo in the dimuon spectrum is due to a small 
miscalibration in the standard CDF tracking momentum scale. 

Events are further required to have at least one jet. First, we correct all jet energies for ^-dependent response of 
the calorimeters and for multiple pp interactions; the leading jet px is required to be greater than 8 GeV/ c. An event 
is vetoed if the second jet cluster, the sub-leading jet, has a px of more than 8 GeV/ c. The leading jet is required to 
be in the pseudo-rapidity range of 0.2 < |?ydct.| < 0.8 to avoid cracks in the central calorimeter. We do not apply the 
rj requirement to sub-leading jets; their pseudo-rapidity can be from -2.8 to 2.8. Then the momentum of the leading 
jet is corrected to the parton level as described above (see Sec. IV A|) . The px of the leading jet, px(jetl), and the pV 
of the Z boson, px(Z), are required to be back-to-back: A0(p*x(jetl), Pt(Z)) > 3.0 rad. 

Rarely the leading jet can originate from another pp interaction produced in the same bunch crossing as the Z+jet; 
the overlapping jets bias the px-balance. The number of interactions in each event is estimated via the number 
of primary vertices along the beam line; the number is used for the corresponding JES correction. The multiple 
interactions are mostly minimum bias events that produce relatively soft jets (px < 8 GeV/ c). 

We veto events with two or more primary vertices in which the leading jet is measured to come from a different 
interaction vertex than that of the hard interaction producing the Z-boson. The jet vertex of origin is determined 
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using tracks pointing to the towers in the jet cluster. For each track we take the ^-coordinate of the point on the 
track closest to the beam-line. A mean value of the z-coordinates is calculated to determine the vertex of jet origin. 
Specifically, events are removed if the leading jet has two or more tracks, and the jet vertex is more than 2 cm away 
from the vertex of the lepton pair along the z-axis. The veto has a negligible effect on the px-balance as the leading 
jet px is required to be above 8 GeV/ c. 

We use simulations to evaluate the bias between the measured pt(Z) and its true value separately for Z e + e~ and 
Z — > /i + ^t~ events. The mean value of the observed j>t(Z) is found to be within 0.5% of its mean generated value in 
both samples. The distribution of the observed transverse momentum of Z-bosons, pr(Z), is shown in Fig. [2j 
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FIG. 2: The observed (points) and expected (histogram) distributions in the transverse momentum of lepton pairs 
with invariant mass between 80< M(££) <100 GeV/c . The solid line is for the PYTHIA and the dashed line is for 
the ALPGEN predictions. All other sample selection cuts have been applied. 



The Z-jet system is not a perfect two-body process and the px-balance, pT(jetl)/pT(^), is sensitive to the sur- 
roundings of the jet. Also, the jet energy resolution is rather poor (in comparison to the jet energy) for jets with px of 
about 10 GeV/c. The resolution improves with higher jet energies from approximately 20% for px(jet) = 40 GeV/ c 
to ^12% for PT(jet) = 140 GeV/c. As an illustration that we can approach the ideal two-body system we apply a 
more stringent event selection; the px of sub-leading jets is required to be less than 3 GeV/c. The px-balance for 
the exclusive event selection is shown in Fig. [3] The distribution for events with pt(Z) < 25 GeV/c is asymmetric 
and shifted from 1.0 due to the finite jet energy resolution and the cut-off on the minimum px of the leading jet (see 
Fig. [3] (a)). The distribution in px-balance for events with pt{Z) > 25 GeV/c is nearly symmetric and peaks close 
to 1.0 (see Fig.|3](b)). Consequently we use events with pt(^) > 25 GeV/c to compare data and predictions. In the 
following analysis, however, we do not use the 3 GeV/c cut-off on the sub- leading jet pt as it was used to effectively 
prove the jet energy correction works properly, but relax the cut-off to 8 GeV/c to better study the systematic 
uncertainties caused by the MC predictions. 



VI. VALIDATION OF SM SIMULATIONS: PROPERTIES OF QUARK AND GLUON JETS 

Properties of a QCD jet depend on the tree-level parton initiating it. A jet initiated by a gluon has a higher 
multiplicity of daughter hadrons than a jet of the same energy initiated by a light quark. The difference in the 
observed particle multiplicities is due to the different color charges of quarks and gluons [28| |. 

A quark jet deposits more energy in the calorimeter system on average than a gluon jet with the same true 
momentum. The difference is caused by the non-linear response of the calorimeter to single particles and the different 
multiplicities of hadrons. The predicted px-balances are presented as a function of pt{Z) for quark and gluon jets 
and for data in Figs. HI [5j and |6] for jet cone radii of 0.4, 0.7, and 1.0. The pT-balance for quark jets is significantly 
different than that for gluon jets; it is consequently essential to check that the mixture of quark and gluon jets is 
predicted accurately by pythia. 

The fraction of quark and gluon jets in the Z-jet sample is largely driven by the parton distribution functions of 
the colliding particles (e.g. pp at the Tevatron and pp at the LHC) and the matrix elements of qq — > Zg and qg — > Zq 
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FIG. 3: The observed (points) and expected (histogram) distributions in p-p-balance, pT(jetl)/pT(-Z), for events 
with pt(^) less than 25 GeV/ c (a) and greater than 25 GeV/ c (b). Sub- leading jets are required to have px < 
3 GeV/c to suppress energy flow outside of the cone of the leading jet (see Section fVA[) . The distribution (a), for 
Pt(-Z') < 25 GeV/c, illustrates the turn-on curve for the 8 GeV/c threshold on the px of the leading jet, pT(jetl) > 

8 GeV/c. 
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FIG. 4: a.) The average px-balance as a function of pt(Z). b.) The ratio of predicted and measured distributions in 
PT-balance. The predicted distribution is for the combination of quark and gluon jets given by pythia. The jets are 
clustered using a cone radius of R=0.4. The pT-balance is noticeably distorted by the cut-off on the minimum px of 
the leading jet and a finite jet energy resolution in events with pt(^) < 25 GeV/ c. These events will not be used 

for the study of the predicted jet energy. 



tree-level diagrams. The px-balance observed in data is different than that from pythia simulations as shown in 
Figs. II El and El 

We test that the discrepancy between data and predictions in the px-balance is not caused by an incorrectly 
modeled fraction of quark and gluon jets using two methods. We compare rapidity distributions for Z+jet events in 
Section [VI Al to validate the relative contributions from qg — > Zq and qq — > Zg LO diagrams in pythia and ALPGEN. 
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FIG. 5: a.) The average px-balance as a function of pr(Z). b.) The ratio of predicted and measured distributions in 
Px-balance. The predicted distribution is for the combination of quark and gluon jets given by pythia. The jets are 
clustered using a cone radius of R=0.7. The px-balance is noticeably distorted by the cut-off on the minimum px of 
the leading jet and a finite jet energy resolution in events with pt(^) < 25 GeV/ c. These events will not be used 

for the study of the predicted jet energy. 



N 



1.25 
1.2 

1.15 
1.1 



Ol 

P 1.05 
S 1 



V 



0.95 
0.9 
0.85 



Z+jet, jet cone = 1 .0 
-»- Data - 4.6 fb" 1 
— - Pythia 
— »— Pythia, quark-jet 
— e— Pythia, gluon-jet 




P T (Z) [GeV/c] 

(a) 



10 2 



1.1 

1.08 
ra 106 

(0 

Q 1.04 
w 

O 1.02 
o 

=5 1 

<D 
i_ 

°- 0.98 



0.96 



0.94 



Z+jet, jet cone = 1 .0 



_L 



20 40 60 80 100 120 

P T (Z) [GeV/c] 

(b) 



140 160 



FIG. 6: a.) The average px-balance as a function of p-r(Z). b.) The ratio of predicted and measured distributions in 
Px-balance. The predicted distribution is for the combination of quark and gluon jets given by pythia. The jets are 
clustered using a cone radius of R=1.0. The px-balance is noticeably distorted by the cut-off on the minimum px of 
the leading jet and a finite jet energy resolution in events with pt(^) < 25 GeV/ c. These events will not be used 

for the study of the predicted jet energy. 



Having the rapidity distributions for data, ALPGEN, and PYTHIA in good agreement, we further test the prediction 
from pythia alone by looking at the number of tracks inside the jet cone (see Section IVl Bp . 

The quark-gluon mixture is sensitive to PDFs, choice of renormalization and factorization scales, and LO matrix 
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element for the Z + 1-parton final state. The impact from these will be reported later in Sections IVII[ IV1II1 IIX1 
andH 



A. Kinematic properties of Z+jet events 



The kinematic properties of Z+jet events provide an indirect test of the quark-gluon composition of the leading jet. 
SM predictions are studied using the distributions of sum and difference of rapidities of a Z boson and the leading jet, 
\y{Z) +r)(jetl)\ and \y{Z) —r](jetl)\, respectively (see Fig.[7J. We require pt{Z) > 15 GeV/c to avoid very soft jets. 
We observe good agreement between data and the predictions (both ALPGEN and PYTHIA). The distributions are 
slightly different for ALPGEN and pythia due to the different methods of computing MEs for the tree-level processes. 




FIG. 7: The rapidity distributions for the Z+jet system. The jet clustering is performed with a cone of R=0.4. 
Figure (a) is difference of the Z and jet rapidities; Figure (b) is the sum. 



B. Charged particle multiplicity in jets 



We perform a direct test of the quark-gluon composition of the observed jets by using the number of tracks observed 
within the cone of the leading jet. The tracks are required to originate from the same vertex as the lepton pair forming 
a Z boson, \zo — z tra ck\ < 4 cm and \do(tracks)\ < 0.02 cm with SVX hits (0.2 cm without SVX). Also we require 
a good track quality for the tracks, namely \ 2 f° r t ne track fit divided by the number of degrees of freedom should 
be less than 6. The transverse momentum of the tracks is required to be greater than 0.3 GeV/c. The number of 
tracks is different for quark and gluon jets (see Fig. [8]). Overall, the observed events are in good agreement with the 
SM predictions (pythia). 

Both tests of the quark-gluon composition of jets have demonstrated that pythia simulations describes the jet 
composition accurately. From now we proceed with investigating the other sources of uncertainties, which contribute 
to the discrepancy in px-balance between data and pythia (Figs.[U[5j and [()])■ 



VII. UNCERTAINTY DUE TO CALCULATION OF THE MATRIX ELEMENTS AND THE 

JET-PARTON MATCHING SCHEME 



Calculation of matrix elements (MEs) for Z+0 partons and Z+l parton is different in ALPGEN+pythia and 
stand-alone pythia. In addition, ALPGEN+PYTHIA simulation uses a veto algorithm to avoid double counting of 
jets produced by matrix elements and radiation from the parton shower Q. The difference in methods affects the 
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FIG. 8: a) The average number of tracks within a jet cone of radius of R=0.4 as a function of pt(Z). b) The ratio of 
the predicted number of tracks to the measured number in data versus pt(-Z). The solid band represents a 3% 

uncertainty on the predicted tracking efficiency [29]. 



Px-balance obtained using the two event generators. To estimate the difference we calculate the px-balance as a 
function of pt(-Z) in both the ALPGEN and PYTHIA samples similarly to that shown in Figs. SI [5j and [5] Then we 
calculate the ratio between the distributions as a function of pt(Z). The mean value of the ratio for events with 
Pt(Z) > 25 GeV/c is taken as the uncertainty due to calculation of MEs and jet-parton matching. The resulting 
uncertainty is found to be about 1% as shown in Table HI 



VIII. PDF UNCERTAINTIES 

We test the sensitivity of the px-balance to the choice of the PDF set used to generate events [3(| . The default PDF 
set, CTEQ5L, is a single set of functions and it does not contain error functions [111]. We estimate the PDF-related 
systematic uncertainty using the Hessian method; we take the CTEQ6M set, which includes 40 eigenvector error 
functions in addition to the central value. We re-weight the existing events with the parton densities provided by 
the CTEQ6M set. For each error set we calculate the difference in px-balance relative to the central value given by 
CTEQ6M. The sum in quadratures of the 40 variations in px-balance results in a negligibly small value (less that 
0.1%) of the uncertainty on the predicted balance. 



IX. ISR UNCERTAINTIES 



In the Monte Carlo sample of inclusive Z events generated using pythia, most of the jets arise from the ISR parton 
shower model (the underlying event model also contributes to production of soft jets), which depends upon several 
parameters that are not tightly constrained by data. As a result, the parameters of ISR affect the observed px-balance 
in Z+jet events. 

To produce the systematics samples we altered the ISR settings used in pythia, parameters PARP(61) and 
PARP(64), similarly to the procedure described in (32[. The parameter PARP(61) defines Aqcd used in running 
of a s in space-like parton showers. By default, the value of Aqcd is chosen according to the PDF parameterizations. 
Parameter PARP(64) is also used in the calculation of a s and parton distributions as a multiplier for the squared 
transverse momentum evolution scale, k\. 

The variations in px-balance calculated for the systematics samples relative to the default are used to estimate the 
ISR uncertainty. The resulting value is on the order of 1%. 
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X. UNCERTAINTY DUE TO RENORMALIZATION AND FACTORIZATION SCALES 



Predictions for Z+jet production are sensitive to the choice of renormalization and factorization scales. The 
accuracy of LO calculations is often estimated by varying renormalization and factorization scales (NLO calculations 
demonstrate weaker dependence on the scales) [33j . The scales impact calculation of the LO matrix elements for 
events with a Z boson and N partons. We exploit ALPGEN to generate events with altered ren ormalization and 
factorization scales. The scales are always kept the same by ALPGEN, with a default value Qo = yMf + T,pj,{jet). 
We produce two "systematic" samples with Qo being multiplied by 0.5 or 2.0. The choice of the scales impacts the 
predicted px-balance by up to 1-2% as recorded in Table HI 

The effect due to variation of renormalization and factorization scales in ALPGEN is similar to that caused by 
variation of the ISR parameters in pythia (see Sec. I Dtp . The stand-alone pythia method of calculations for the 
inclusive production of Z bosons uses an ISR parton shower to estimate the LO matrix element for Z plus one parton. 
Both methods deal with the same computational uncertainty caused by limited accuracy of the LO calculations used in 
the two MC event generators so that the variation of scales results in similar uncertainties for the both generators. We 
use the variation in px-balance estimated with ALPGEN and not that obtained with pythia to avoid double-counting 
the effect. The resulting effect on the predicted px-balance is about 1-2% as recorded in Table HI 



XI. CHARACTERISTICS OF OUT-OF-CONE RADIATION 



An understanding of the energy flow outside of the cone of the leading jet is essential for interpreting the measure- 
ment of px-balance in Z-jet events. Raw calorimeter energy summed in annuli outside of the jet cone is not linearly 
proportional to the true out-of-cone energy. 

We exploit correlations between px-balance and properties of the sub-leading jet such as px(jet2) and A<fi(jetl — 
jet2). Multiple pp interactions produce additional soft jets that are unrelated to the jets recoiling against the Z-boson. 
The presence of additional interactions might bias the correlation between the px-balance and the properties of the 
sub- leading jet. In the method described in this section we require all events to have exactly one primary vertex, i.e. 
one interaction. The requirement to have exactly one primary vertex was not applied before since we did not have to 
measure jet clusters with px softer than 8 GeV/ c. 

We measure the dependence of the px-balance on the azimuthal angle between the leading jet (jetl) and the sub- 
leading one (jei2), A(j>(jetl - jet2), for events with px(-Z') > 25 GeV/c (see Figs. H [10l and[TT|). The observed 
dependence of the px-balance on the angular separation between the 1st and the 2nd jets is sensitive to the jet cone 
size. As expected the largest discrepancy between data and predictions is observed with the smallest jet cone size of 
R=0.4 (see Fig. [9]), for which the particle-jet-energy to parton-jet-energy correction is at maximum. Larger jet cone 
sizes give result in better agreement between data and PYTHIA simulations. Two cone-0.4 jets can be reconstructed 
as one cone-1.0 jet. As a result the large-angle radiation most likely falls within the jet cone of R=1.0. 

The positive correlation between the px-balance and A0(jeil — jet2) shows that the 2nd jet is often caused by the 
parton radiation from the leading jet as the magnitude of the correlation is proportional to the rate of the large-angle 
parton radiation. A negative slope of the ratio between data and predictions (e.g. see Fig. [9^b) ) indicates that 
the data exhibit more large-angle parton radiation than the MC simulations. The deficiency of large-angle parton 



radiation is also consistent with the results from 34 [ 



We measure the dependence of the px-balance on the px of the second jet, px(jet2). The balance as a function of 
the 2nd jet px is shown in Figs.[T21[T31 and [TH for different sizes of jet cones. The balance is most sensitive to the 2nd 
jet px when the jet cone is 0.4 (see Fig. [T2"|) . The observed rate of large-angle out-of-cone radiation is also observed 
to be higher in data than in the predictions. 



A. Uncertainty due to out-of-cone radiation 



We use the px-cutoff of the sub-leading jet to estimate the variation of the balance due to the out-of-cone radiation. 
The agreement between data and predictions improves as we decrease the cut-off value on px(jei2) as shown in 
Figs. Q21 [T3l and [14] (all the other selection requirements are kept the same). The extrapolation to the point where 
Px(jei2) is zero describes the case where both data and predictions do not have any large-angle FSR. 

We use the ratio of px-balance between data and prediction to estimate the uncertainty due to large-angle FSR. 
The ratio is obtained as a function of px of the sub- leading jet, px(je£2). To obtain the systematic uncertainty we 
multiplied the mean value of pxG/ei2) by the slope of the linear fit of the ratio. The uncertainties are summarized 
in Table H] As an alternative approach, one can take the difference between the mean value of the ratio and that 
obtained using the linear extrapolation to pxG/e£2) = GeV/ c as a systematic uncertainty. The latter approach gives 
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FIG. 9: a.) A comparison of the measured (square markers) and predicted (triangle markers) px-balance as a 
function of A(f>(jet\ — jet2) for jets of R=0.4 cone size. The predicted balance is obtained with pythia. The events 

are required to have an only one interaction per event, b.) The fit of the ratio to a line results in x 2 /NDF = 
10.4/14 and slope = -8.12-10~ 4 ± 1.96T0 -4 , as could be explained by an inadequate modeling of large-angle parton 

radiation. 
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FIG. 10: a.) A comparison of the measured (square markers) and predicted (triangle markers) px-balance as a 
function of A<f>(jetl — jet2) for jets of R=0.7 cone size. The predicted balance is obtained with pythia The events 
are required to have an only one interaction per event, b.) The fit of the ratio to a line results in y 2 /NDF — 7.0/14 

and slope = -6.9-10" 4 ± 1.9-10~ 4 . 



different values for the uncertainty than the first one ( -2.0%, +0.4%, and 2.4% for the jet cone sizes of 0.4, 0.7, and 
1.0, respectively). 
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FIG. 11: a.) A comparison of the measured (square markers) and predicted (triangle markers) px-balance as a 
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FIG. 12: a.) A comparison of the measured (square makers) and predicted (triangle markers) px-balance as a 
function of the 2nd jet px for jets of 11=0.4 cone size. The predicted balance is obtained with PYTHIA. The events 
are required to have only one interaction per event, b.) The ratio of predicted to measured px-balance versus the 
px of the second jet. The linear fit of the ratio resulted in a slope of 0.78±0.48 %/ GeV/ c. 



XII. FINAL STATE RADIATION UNCERTAINTIES 



Final state radiation (FSR) from unfragmented partons is modeled by time-like parton showers in pythia. The 
predicted px-balance is sensitive to the rate of FSR. In this study, the variation of FSR parameters is performed 
similarly to that for ISR (see Section HXl and [32j). To produce the systematics samples we altered the parameters 
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function of the 2nd jet px for jets of R=0.7 cone size. The predicted balance is obtained with pythia. The events 
are required to have only one interaction per event, b.) The ratio of predicted to measured px-balance versus the 
Pt of the second jet. The linear fit of the ratio resulted in a slope of 0.02±0.4 %/ GeV/ c. 
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FIG. 14: a.) A comparison of the measured (square markers) and predicted (triangle markers) px-balance as a 
function of the 2nd jet px for jets of R=1.0 cone size. The predicted balance is obtained with pythia. The events 
are required to have only one interaction per event, b.) The ratio of predicted to measured px-balance versus the 
px of the second jet. The linear fit of the ratio resulted in a slope of -0.47±0.42 %/ GeV/c. 



PARP(72) and PARP(71) in pythia. The value of PARP(72) defines the parameter Aqcd used in simulation of 
time-like showers of partons produced in ISR. By default, the same Aqcd parameter from the PDF set (CTEQ5L 
in this case) is used for ISR and FSR partons. The value of PARP(71) is used as a multiplicative scaling factor to 
the Q 2 scale of the hard scattering to define the maximum parton virtuality for time-like showers. The variation of 
PT-balance in the FSR systematics samples is about 0.1-0.4%, as shown in Table HI 
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XIII. MULTIPLE PROTON-PROTON INTERACTIONS 

The px-balance is sensitive to multiple pp interactions overlapping in-time with the hard process. The number 
of interactions per event is estimated by observing additional primary vertices along the beam-line. The additional 
interactions are likely to be minimum-bias collisions. 

The uncertainty in the px-balance arises from the present limited ability of PYTHIA to describe the calorimeter 
energy in a minimum bias event accurately. The momentum distribution of charged particles in the predictions, 
measured in the magnetic spectrometer (COT), was tuned to data only for particles with px> 0.5 GeV/ c [35| as soft 
charged particles curl up in the magnetic field and do not reach the calorimeters. Consequently the response for soft 
(pt< 0.5) neutral hadrons is not measured. The px-balance varies to the number of primary verticies differently in 
data and predictions. The difference in variation of the px-balance, on the order of a percent, is taken as a systematic 
uncertainty (see Table QJ. 



XIV. DETECTOR SIMULATIONS FOR THE JET ENERGY RESPONSE 

The jet energy scale relies on measurements of the response of the calorimeters to single charged particles whose 
momenta are measured precisely using the magnetic spectrometer of CDF as well as test-beam measurements at the 
highest momenta @- Uncertainties in transferring the measured single-particle response to a parametric model of 
the calorimeter jet response contribute significantly to the uncertainty on the CDF jet energy scale. The net effect of 
varying the single particle response on the simulated px-balance is typically 2.5% as shown in Tabic fl] 



XV. SUMMARY OF SYSTEMATIC UNCERTAINTIES 

We observe a significant discrepancy in the px-balance of Z-bosons and single jets between measurements and 
prediction, with the data being lower in the px of the jet compared to predictions. We find that pythia predicts the 
fractions of quark and gluon jets accurately. In Table Q] we summarize the estimated variations (taken as systematic 
uncertainties) of the predicted balance. The totality of the variations is comparable to the observed discrepancy, with 
the largest contributions being those from large-angle FSR and the modeling of the single-particle response of the 
calorimeters. 



Source of uncertainty 


R = 0.4 


R = 0.7 


R = 1.0 


renormalization and factorization scales 


+0.9 -0.0 


+0.9 -0.4 


±0.4 


FSR parameters in pythia 


±0.4 


±0.1 


±0.1 


MEs and jet-parton matching 


+0.8 -0.0 


+1.1 -0.0 


±0.8 -0.0 


detector simulations (single particle response) 


±2.5 


±2.5 


±2.5 


multiple proton interactions 


+1.0 -0.0 


+1.2 -0.0 


±1.2 -0.0 


large- angle FSR (limitation of PS) 


+0.0 -2.9 


+0.0 -0.2 


±1.7 -0.0 


Estimate of the total variation 


+3.0 -3.8 


+3.1 -2.5 


±3.4 -2.5 


The observed discrepancy 


+4.7 


+3.2 


±2.0 



TABLE I: The effect on the predicted mean px-balance of varying parameters in the modeling and event selection, 
in percent, for jet cone sizes R = 0.4, 0.7, and 1.0. The variations are evaluated for PYTHIA events with px(-Z) > 

25 GeV/ c. The observed discrepancy is defined as the px-balance in predictions divided by that in data; the 
predicted jet energies are higher than those in data. The discrepancy between data and predictions is comparable 
with the estimate of the total variation of the predictions. A positive variation in the predicted px-balance 
corresponds to an increase in the jet energies in the MC predictions. The total variation is calculated by adding the 

uncertainties in quadrature. 

Data indicates that the large-angle parton radiation is not modeled accurately by the leading-log parton showering 
from pythia Tune AW: the MC simulation underestimates the large-angle radiation. The discrepancy is not addressed 
by the predictions from ALPGEN ± pythia either; the ME calculations affect only jets with px above the matching 
scale (15 GeV/c in the analysis); softer radiation (px(jet2) < 8 GeV/c in the analysis) is simulated via the parton 
showering mechanism. 
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XVI. CONCLUSIONS 



We have estimated the sensitivity of the px-balance method to a number of theoretical factors: the virtuality- 
ordered parton showering from pythia, tree-level matrix elements, parton distribution functions, parton-jet matching 
procedure, renormalization and factorization scales, multiple pp interactions, and calorimeter response of single stable 
particles. The contribution from each source of uncertainty is presented in Table HI The uncertainty caused by 
inadequate modeling of the parton shower at large angles is found to be the largest. The sum of the uncertainties 
is consistent with the discrepancy between data and predictions in the px-balance. The remaining uncertainties are 
significantly smaller Q. 

Numerous modern higher-order MC simulations utilize leading-log parton showering from pythia 36, 37]. The 
higher-order calculations of the matrix elements are less sensitive to the choice of renormalization and factorization 
scales so that the related uncertainty should be smaller than that we evaluated. However, the uncertainty due to large- 
angle parton radiation is expected to be of the same magnitude as in the study. We encourage the LHC experiments 
to use the distributions in Figs. |H1 [TD1 [11] and [T5] [T3] HH as a systematic method for tuning the parton showering 
parameters in event generators for more accurate jet energy measurements. 
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Appendix: Lepton identification 

We use standard CDF definitions for identification (ID) of electrons and muons as described below [38[. The same 
lepton ID requirements are applied to events from data and Monte Carlo simulations. 

The identification and triggering efficiencies for leptons are different for events in data and Monte Carlo, although 
they demonstrate a very similar energy dependence. To eliminate this inconsistency we follow the standard CDF 
practice of using correction factors ( "scale factors" ) to re- weight the MC events (see Section [3]) . 

In order to maintain a high efficiency for Z bosons, for which we require two identified leptons, we define "tight" 
and "loose" selection criteria for both electrons and muons, as described below. 

To reduce backgrounds from the decays of hadrons produced in j ets, leptons are required to be "isolated" . The 
Et deposited in the calorimeter towers in a cone in r\ — ip space [lOj of radius R = 0.4 around the lepton position is 
summed, and the Et due to the lepton is subtracted. The remaining E^ is required to be less than 10% of the lepton 
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Et for electrons or px for rations. 



1. Electron selection 



An electron candidate passing the "tight" selection must be central with Et > 20 GeV, and have: a) a high quality 
track [39] with px > 0.5 -£x or px > 50 GeV/ c; b) a good transverse shower profile at shower maximum that matches 
the extrapolated track position; c) a lateral sharing of energy in the two calorimeter towers containing the electron 
shower consistent with that expected; and d) minimal leakage into the hadron calorimeter po| . 

Additional central electrons, classified as "loose" electrons, are required to satisfy the "tight" central electron 
criteria but with a track requirement of pt > 10 GeV/c (rather than 0.5 • -Et), and no requirement on a shower 
maximum measurement or lateral energy sharing between calorimeter towers. Electrons in the forward calorimeters 
(1.2 < \rj\ < 2.5), also classified as "loose" electrons, are required to have Et > 20 GeV, minimal leakage into the 
hadron calorimeter, a track containing at least 3 hits in the silicon tracking system, and a shower transverse shape 
consistent with that expected, with a centroid close to the extrapolated position of the track 41 1. 



2. Muon selection 

A muon candidate passing the tight cuts must have: a) a well measured track in the COT 1431 with px > 20 GeV/ c; 
b) energy deposited in the calorimeter consistent with expectations [4j|- c) a muon "stub" [44| in both the CMU and 
CMP, or in the CMX, consistent with the extrapolated COT track [45|]; and d) a COT track fit consistent with an 
outgoing particle from a pp collision and not from an incoming cosmic ray [46[ . 

Additional muons, classified as "loose" , are required to have px > 20 GeV/ c and to satisfy the same criteria as 
for tight muons but with relaxed COT track quality requirements. Alternatively, for muons outside the muon system 
fiducial volume, a loose muon must satisfy the tight muon criteria and an additional more stringent requirement on 
track quality, but the requirement that there be a matching "stub" in the muon systems is dropped. 



3. Corrections due to Modeling of Electrons and Muons 

Following the standard treatment of lepton efficiencies in CDF, we re- weight Monte Carlo events to take into account 
the difference between the identification efficiencies measured in leptonic Z decays and those used in simulation [47} . 
We then make additional corrections for the difference in trigger efficiencies in simulated events and measured in data. 
Corrections to trigger efficiencies are typically 4% for trigger electrons, 8% for trigger muons that traverse both the 
CMU and CMP systems, and 5% for muons in the CMX system. The average weight for Z — > e + e~ events is 0.939; 
for Z — > /i + /J~ events it is 0.891. 

We correct the energy of electrons and muons the same way as it was done for the measurement of the W boson 
mass [481 ] . The relative positions of the tracker wires were aligned using cosmic muons. Additional track- level 
corrections were derived using W — > ev data to reduce bias between positive and negative particles. The electron 
energy was corrected in data for effects due to tower position and time (aging). 



